A Tree Based Association Rule Approach for XML Data with Semantic Integration

نویسندگان

  • D. Sasikala
  • K. Premalatha
چکیده

The use of eXtensible Markup Language (XML) in web, business and scientific databases lead to the development of methods, techniques and systems to manage and analyze XML data. Semi-structured documents suffer due to its heterogeneity and dimensionality. XML structure and content mining represent convergence for research in semi-structured data and text mining. As the information available on the internet grows drastically, extracting knowledge from XML documents becomes a harder task. Certainly, documents are often so large that the data set returned as answer to a query may also be very big to convey the required information. To improve the query answering, a Semantic Tree Based Association Rule (STAR) mining method is proposed. This method provides intentional information by considering the structure, content and the semantics of the content. The method is applied on Reuter’s dataset and the results show that the proposed method outperforms well. Keywords—Semi--structured Document, Tree based Association Rule (TAR), Semantic Association Rule Mining.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)

Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...

متن کامل

Semantic Integration of Schema Conforming XML Data Sources

A challenging problem in Web engineering is the integration of XML data sources. Even if these data sources conform to schemas, they may have their schemas and the correspongind XML documents structured differently. In this paper, we address the problem of integrating XML data sources (a) by adding semantic information to document schemas, and (b) by using a query language that allows a partial...

متن کامل

Mining Interesting Clinico-Genomic Associations: Tlie HealthObs Approach

HealthObs is an integrated (Java-based) environment targeting the seamless integration and intelligent processing of distributed and heterogeneous clinical and genomic data. Via the appropriate customization of standard medical and genomic data-models HealthObs achieves the semantic homogenization of remote clinical and gene-expression records, and their uniform XML-based representation. The sy...

متن کامل

A Rule-Based Conversion of XML Document Type Definition to a Conceptual Schema

XML raises as a standard for semi-structured and structured data representation and exchange over the Web. Currently, Web-based Information Systems require semantic integration mechanisms for XML data to obtain a global view of heterogeneous XML sources of a given domain. This paper describes a process for converting XML DTDs to conceptual schemata in XCM (XML Conceptual Model), a conceptual mo...

متن کامل

Processing Preference Queries in Standard Database Systems

Turkish information retrieval : past changes future p. 13 From on-campus project organised problem based learning to facilitated work based learning in industry p. 23 Innovative information and knowledge infrastructures-how do I find what I need? p. 34 XMask : an enabled XML management system p. 38 Validation of XML documents : from UML models to XML schemas and XSLT stylesheets p. 48 A novel c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015